Generating data sets for benchmarking
نویسنده
چکیده
A new method of benchmarking neural networks based on Voronoi diagrams is introduced. Their complexity is examined and it is shown that data sets of increasing difficulty may be generated. Experiments are conducted examining the performance of five known classification methods on examples of Voronoi data sets.
منابع مشابه
EMBench: Generating Entity-Related Benchmark Data
The entity matching task aims at identifying whether instances are referring to the same real world entity. It is considered as a fundamental task in data integration and cleaning techniques. More recently, the entity matching task has also become a vital part in techniques focusing on entity search and entity evolution. Unfortunately, the existing data sets and benchmarking systems are not abl...
متن کاملGenerating Benchmarks by Random Stepwise Refinement of Petri Nets
The quality of algorithms is often determined by benchmarking, i.e., testing the algorithm on a predetermined data set. In contrast to traditional benchmarking, with fixed data set, we present a way to generate random sets of test data. In this paper we present random classes of Petri nets and a method to generate finite samples from such a class. The classes may contain infinitely many Petri n...
متن کاملBDGS: A Scalable Big Data Generator Suite in Big Data Benchmarking
The complexity and diversity of big data systems and their rapid evolution give rise to various new challenges about how we design benchmarks in order to test such systems efficiently and successfully. Data generation is a key issue in big data benchmarking that aims to generate application-specific data sets to meet the 4V requirements of big data (i.e. volume, velocity, variety, and veracity)...
متن کاملOn Benchmarking Optical Flow
Evaluating the performance of optical flow algorithms has been difficult because of the lack of ground truth data sets for complex scenes. We present a new method for generating motion fields from real sequences containing polyhedral objects and present a test suite for benchmarking optical flow algorithms consisting of complex synthetic sequences and real scenes with ground truth. We provide a...
متن کاملBigOP: Generating Comprehensive Big Data Workloads as a Benchmarking Framework
Big Data is considered proprietary asset of companies, organizations, and even nations. Turning big data into real treasure requires the support of big data systems. A variety of commercial and open source products have been unleashed for big data storage and processing. While big data users are facing the choice of which system best suits their needs, big data system developers are facing the ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1995